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This paper presents a variational problem on the Stiefel manifold of orthogonal n frames in TV 
dimensional real Euclidean space, and its corresponding optimal control counterpart. Solutions 
to the variational and optimal control problems are obtained, and some of their geometric and 
analytic properties studied. This is an extension of earlier work by Bloch et al (2002) on the 
symmetric representation of the rigid body equations (which correspond to the extreme case 
of n — N) on the Cartesian product SO(N) x SO(N). We characterize the space of solutions 
of the optimal control problem and the nature of the geodesic flows on the Stiefel manifold. 
The discrete version of this problem has been analyzed in the seminal work by Moser and 
Veselov (1991). Theorem 4 of Moser and Veselov (1991) gives a set of isospectral deformations 
for the discrete geodesic flow, which can be viewed as a discrete analogue of the parameter- 
dependent Lax representation. Recent results by Bolsinov and Jovanovic (2004) demonstrate 
that bi-invariant geodesic flows on Stiefel manifolds are integrable for a S^AQ-invariant met- 
ric. However, integrability has not yet been demonstrated for geodesic flows on general Stiefel 
manifolds with left-invariant metrics. 

We present a generalization of the Lax pair form for the equations of motion and show 
how this reduces to the classical Lax pair form in the case of the rigid body equations and the 
geodesic flow on the ellipsoid. The integrability of the rigid body equations by a Lax pair for- 
mulation with parameter had been shown by Manakov (1976); Mischenko and Fomenko (1978) 
showed that a similar formalism exists for any semisimple Lie group. Further references on 
parameter-dependent Lax pair formulations for integrable systems are given in Fedorov (1995). 
The geodesic flow on an ellipsoid, which corresponds to the n = 1 case, and its integrability 
have been treated by Moser and Veselov (1991), Knorrer (1980), and others. The paper by 
Moser and Veselov (1991) on the discrete variational version of this problem also gave a dis- 
crete Lax pair formulation with parameter, thereby demonstrating that the discrete geodesic 
equations on the Stiefel manifold are indeed integrable. 

We describe here how to obtain geodesic flows using the maximum principle of optimal 
control theory (see Bloch et al (2003), Gelfand and Fomin (2000), and Kirk (2004)). We show 
how to relate this optimal control formulation to the form naturally derived from variational 
calculus. We also relate these extremal flows to the Hamiltonian flow using the natural 
symplectic structure on the cotangent bundle of the Stiefel manifold. The extremal flows 
obtained here for the Stiefel manifolds are similar to the Hamiltonian flows on the "extended 
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Stiefel varieties" as described in Federov (2005). Finally, we demonstrate that the natural 
symplectic manifold carrying the extremals of the optimal control problem is symplectomorphic 
to the cotangent bundle of the Stiefel manifold. In the next section, we pose our problem on 
a general Stiefel manifold, and give the extremal flows obtained in the limiting cases of the 
sphere/ellipsoid (n = 1), and the N dimensional rigid body (n = N). Section 3 presents the 
extremal solution to the variational problem for the general case, and connects this solution to 
the extremal flow on the cotangent bundle, given by equation (j4T)|) . Section 4 gives the extremal 
solution of the optimal control problem restricted to a symplectic submanifold of R nN x R nN 
that has the dimension of the cotangent bundle. This section also gives the correspondence 
between this extremal solution and the extremal solution of the variational problem. Section 
5 presents the structure of the tangent and cotangent bundles of the Stiefel manifold, and 
establishes a symplectomorphism between the manifold carrying the extremal solutions of the 
optimal control problem, and the cotangent bundle. Section 6 presents a few applications 
and some future research issues of interest regarding geodesic flows on Stiefel manifolds, while 
Section 7 presents some concluding remarks. 

2. Background and Limiting cases 

We introduce the variational and optimal control problems on a Stiefel manifold in this section, 
based on minimizing the time integral of the kinetic energy. The metric on the manifold is 
given by the kinetic energy expression. We also give the extremal flows obtained in the limiting 
cases of the sphere/ellipsoid (n = 1), and the N dimensional rigid body (n = N). The extremal 
flows in these cases are well-known and integrable, and have been given in several earlier works 
such as Knorrer (1982), Moser (1980), and Bloch et al (2002). 

2.1. Variational and Optimal Control Problems on a Stiefel Manifold 

The Stiefel manifold V(n, N) C M nAr consists of orthogonal n frames in N dimensional real 
Euclidean space, 



V(n, N) = {Qe W 



nN 



QQ 



In}- 



Introduce the pairing in W s given by 

(A } B) = Tr(A T B) 1 
where Tr(-) denotes trace of a matrix and the left invariant metric on R' 



given by 



(1) 



((W 1 ,W 2 )) = (W 1 A,W 2 ) = (W 1 ,W 2 A), 
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where A is a positive definite N x N diagonal matrix. The pairing (|1J) was used in Ratiu (1980) 
as a positive definite bilinear form on so(n). We are interested in the variational problem given 
by: 

min / h(Q,Q))dt (3) 
Qv) Jo z 

subject to: QQ T = I n , Q e R nN , 1 < n < N, Q(0) = Q , Q(T) = Q T , I n denotes the n x n 

identity matrix. This is a variational problem defined on the Stiefel manifold V(n,N). The 

dimension of this manifold is given by 

^ Tr / i „ T n(n + l) ,, T . n(n — 1) 
Dim V(n, N) = nN v — ; = n(N - n) + -^-^ — '-. 

The corresponding optimal control problem is given by: 

min f h{QU,QV))dt (4) 
u y) Jo z 

subject to: Q = QU; QQ T = I n , Q(0) = Q , Q(T) = Q T where U G so(N). Note that the 
quantity to be minimized is invariant with respect to the left action of SO(n) on V(n, N) since 
the metric (0) is left invariant. 

2.2. The Rigid Body equations 

For the special case when n = N, V(N,N) = SO(N) and the extremal trajectories of the 
optimal control problem (0J) give the iV-dimensional rigid body equations. The usual system of 
rigid body equations on T*SO(N) are 

Q = QU, M=[M,U], (5) 

where M = J(U) = AU + UA is the body momentum and [■, ■] denotes the matrix commutator. 
Equations (jSJ can also be obtained directly from the variational problem (j^J in the case n = N. 
As shown by Manakov (1976) and Ratiu (1980), the Euler equations are integrable with a 
parameter-dependent Lax pair representation 

j(M + AA 2 ) = [M + AA 2 , U + AA] . 

In the optimal control approach, the costate P is a vector of Lagrange multipliers used to enforce 
the equality constraint Q = QU (see Gelfand and Fomin, 2000). The extremal trajectories for 
the optimal control problem in this case are given by 

Q = QU, P = PU, (6) 

where U G so(N) and [•, •] denotes the matrix commutator. In the symmetric representation 
of the rigid body equations given in Bloch et al (2002), the states Q and the costates P are 
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both orthogonal N x N matrices, and the extremal trajectories are on SO(N) x SO(N). For 
the symmetric representation of the rigid body equations (|Bj). U is regarded as a function of Q 
and P via the equations 

U = J~\M) eso(N), and M = Q T P -P T Q. 

The inverse of the mapping $ : (Q, P) i— > (Q, M) = (Q, Q T -P — P^Q) is given by 

P = g(e sinh " 1 f), (7) 

where sinh: so(iV) — > so(N) and its inverse are defined in Bloch et al (2002). The spatial 
momentum given by 

m = QMQ T = PQ T - QP T , 

is conserved along the flow given by (jUJ). Equivalence between (JBJ) and (J5J) was established by 
Bloch et al (2002) on the sets S and Sm where 

S ={(Q,P)eSO(N)xSO(N); \\M\\op<2}, 
S M = {(Q,M) G SO(N) x so(N); \\M\\ p < 2}, 
and ||v4||op = sup{||Ar|| : ||x|| = 1} is the operator norm. $ _1 is then well defined on Sm- 

Note that for the extremal flow to the optimal control problem given by (JBJ), one need not 

rp 

choose the costates to be orthogonal. One can choose a costate vector Pq such that Q Pq is 
skew symmetric, for example. In that case, the extremal flow is given by 

Q = QJ-\Q T P ), Po = PoJ-\Q T P ), (8) 

and these equations restrict to the invariant submanifolds defined by Q Pq G so(N). These are 
the McLachlan-Scovel equations (see McLachlan and Scovel, 1995). Comparing these equations 
with (0), we see that P = QM = P — QP^Q. In Section HI we obtain a generalization of the 
McLachlan-Scovel equations to the Stiefel manifolds V(n,N), where 1 < n < N. 

2.3. Geodesic flow on the ellipsoid 

For the other extreme case, when n — 1, we obtain the equations for the geodesic flow on the 
sphere V(1,N) = with Q = , q^q = 1. This can be also be regarded as the geodesic 

flow on the ellipsoid 

q T K- l q = 1, 
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where q = A~ l / 2 q. The costate variable P = is used to enforce the constraint q = —Uq for 

the optimal control problem (j3J) when n — 1. The extremal solutions to this optimal control 
problem are 

q = —Uq, p = —Up + Aq, (9) 

rp rp 

where A = qq UAU — UAUqq . These extremal solutions have the same form as the extremal 
solutions for the general case (when 1 < n < N) given in Section 3. The body momentum is 
obtained as 

M = qp T - pq T , (10) 

in terms of the solution (q, p) to the optimal control problem. Equations (JH1 can than be 
expressed in terms of the body momentum as 

q = -Uq, M = [M, U] - A. (11) 

The body momentum can also be expressed in terms of the solution to the variational problem 
(q, q), which we write in the form (q, s), where s = Aq = —AUq, as 

M = qs T - sq T = qq T UA + AUqq T . 

Pre- and post-multiplying the above expression with A -1 , we obtain 

A^MA" 1 = A- l qq T U + Uqq T A- 1 . (12) 

Since U is skew-symmetric, q^Uq = 0, and post-multiplying both sides of equation (fT2"|) with 

the vector q, we get 

A- l MA~ l q = Uq(q T A- 1 q) Uq = A ^ MA - q. (13) 

q L A l q 

Note that equation (fT3j) specifies U upto the equivalence class 

[U], V e[U}^Uq = Vq. 

Equation (|T^I gives our choice for U as 
_ A^iWAr 1 
q T A~ 1 q 

From this expression for U, we obtain 

UAU = -^—A- 1 {p T A- 1 q(qp T + pq T ) - (g T A~ 1 g)pp T - (p T A- 1 p)qq T ^) A~\ 
and 



where 

A = (p T A- 1 q) 2 - (p T A- 1 p)(q T A- 1 q). 
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This gives us A in equation JSJ) as 

A = [qq T , UAU] = - T i— - ( gg T A -i - A- x gg T )A. (14) 

Equations 0, (|13|) and (|14|) defines extremal trajectories of the optimal control problem on 
V(1,N) = § N -\ 

The Lagrangian (variational) formulation for this problem gives us the equations for the 
geodesic flow on the sphere. To obtain these equations, we take reduced variations (see Marsden 
and Ratiu, 1999) on V(l, N) = S^ -1 . The equation of motion can be written as 

Aq = bq, (15) 

where b is a real scalar in this case. We determine b from the constraint q q = 1. Differentiating 
this constraint with respect to time twice, we get 

T T 

q q + q q = 0. 

Substituting for q from equation (JT5|l into the above expression, we get 

•T • 

b= q q 



q T A- l q 

Thus, we get the Lagrangian (variational) equations for the geodesic flow on the sphere (S 7 ^ 1 ) 
as 

•T • 

Q_Q A -i 

q = — A q. (16) 

q 1 A~ l q 

Integrability of these extremal flows were proven by Jacobi with relation to the Neumann 
problem of motion on the sphere with a quadratic potential, as shown by Knorrer (1982). 
Contemporary version of integrability of the geodesic flow on an ellipsoid was demonstrated 
by Moser (1980) using the Theorem of Chasles and geometry of quadrics. We now obtain a 
Manakov Lax pair formulation for this system. In this case, we have 

AU — UA — (MA' 1 - A- l M)/q T A- l q. 

Using the body momentum equation in ([lip, we now get 

jM(X) = [M(\\U(\% (17) 

where 

M(X) = M + XA + qq T \- 1 A/q T A- 1 q, (18) 
17(A) = U - XA- 1 /q T A- 1 q. (19) 

From (|17p. we see that the coefficients of A- 7 in the expansion of ^tr(M(A)) fc are conserved 
along the geodesic flow. 
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Now we look at the solution to the general problem (jSJ) posed on the Stiefel manifold V(n, N) 
where 1 < n < N . The Lagrangian for this variational problem is 

£(Q, Q) = \((Q,Q)) = \tt(aq t q), (20) 

where Q G V(n,N), i.e., Q satisfies QQ^ = I n . 
3.1. Solution to the Variational Problem 

The solution to the variational problem (JHJ) posed on V(n, N) is given by the following result. 
Theorem 1. The variational problem fjj on the Stiefel manifold V(n,N) has the solution: 

Q = QU, Ueso(N), (21) 
Q T QUA + KUQ T Q = AU 2 Q T Q - Q T QU 2 A. (22) 

Proof: We take reduced variations on the Stiefel manifold given by QQ = I n ; this ensures 
that the variation vector field is always locally tangent to the manifold. The reduced variations 
are given by 

5Q = QZ, 5Q = QZ + Q±, (23) 

where QS G TqV(n, N) and £ is an iV x iV skew-symmetric matrix. These variations have 
fixed end points, i.e., S(0) = S(T) = 0. The kinematic expression (|2~T]) is obtained easily from 
the constraint QQ^ = I n . On taking the first variation of the integral quantity in (jSJ) and 
setting it to zero, we obtain 

T 

{(5QA,Q) + (QA,5Q)}dt = 0. 



o 



On carrying out integration by parts using fixed end point variations, the above expression 
simplifies to 

{(QSA,Q) + (gA,QS)}dt = 

<=> I (QA,QE)dt =0 
Jo 

Tr(AQ T Q£) = 

AQ T Q - Q T QA = 0, (24) 

i.e., AQ^-Q is symmetric. Taking the time derivative of Q in (j21j) . we get 
Q = QU 2 + QU. 
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Substituting this for Q in equation ({231 gives the solution of the variational problem as given 
by equation ((221) • 1=1 
Note that the flow on the Stiefel manifold induced by equation (|22j) is invariant to variations 
within the equivalence class [U] C so(N), defined by 

V G [U] QV = QU. (25) 

Since QQ^ = I n , the left-multiplication of equation (|2*2*Jl by Q gives 



QUAQ T + QAUQ T = Q[A,U 2 ]Q T . (26) 

rp 

Here W = UA + AU G so(iV) is a skew-symmetric matrix that also satisfies QWQ = 
Q[A,U 2 }Q T . Equation © 

is a statement of the conservation of the quantity 

m = QUAQ T + QAUQ T = SQ T - QS T , (27) 

where S = QUA = QA, along the flow of the vector field given by equations (|2*T|) - (|2*2*jl . Note 
that 

m = QWQ T - Q[A, U 2 }Q T = 

by equation ()2Kj1 . The quantity m is analogous to the spatial momentum of the rigid body, 
which is conserved along the flow of the rigid body equations. The equations (|2*T]) - (f2"2"J) can also 
be written as 

Q = QU = SA-\ Ueso(N), (28) 
Q T S - S T Q = 0. (29) 

The above formulation of the variational problem, equations (|28 |l -(|29 jl in particular, do not 
give an implicit or explicit equation for the rate S. To obtain such an expression, we need the 
following result. 

Proposition 1. Define the linear map Lq : M. nN — > 1^, given by 

L Q (X) = Q T X-X T Q, QeV(n,N). (30) 

If X G M. nN is in the kernel of this map, then X is of the form 

X = BQ, B = B T . (31) 
Proof. If X is in the kernel of the Lq defined by (|30|). then X = X^-Q and hence X 

rp rp rp 

is symmetric. Let us denote the rows of Q by the orthonormal vectors q } , g 2 , . . ., g„ . We 
complete a right-handed orthonormal basis of R from this set of orthonormal vectors; the 

rp rp 

additions to this set are denoted by the vectors q„ +1 , ■ ■ ■ , qf;- The vectors of this orthonormal 
basis are arranged in columns to form the orthogonal matrix 

C=[qi q 2 ■■■ q n q n +i ••• qN]^SO(N). 
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rp 

We express C in the block diagonal form 
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(32) 



C 



T 



C\ C3 
d d 

where the diagonal blocks C\ and d are n x n and (iV — n) x (AT — n) matrices respectively. 
Hence, from equation (J32J), we get 



g = [4 o]c T = [d c 3 ). 

We apply the same transformation to X e !R nAr to get 

Y = XC= [Fx Y 2 ], (34) 

where Y x e R nn and Y 2 e R n(N - n) . Since Q T X = CR T YC T , the relation Q T X = X T Q can 
be expressed as 

R T Y = Y T R & Yx = Y? = B, Y 2 = 0. (35) 

Then we have 

C\ C3 

C*2 C4 
= [fid fid] 

= fi[d d] = 

using the expression in equation for Q. This gives us the result in equation (f3~Tj) . where 



(33) 



X = [Yi 0] 



fi = Y : 1 — Yi is a symmetric n x n matrix. 



□ 



Since from (J2HJ), S 1 is in the kernel of Lq, using the above result we get 

S = BQ, B = B T . (36) 

The following lemma is necessary to obtain an expression for the flow of the quantity S and 
leads to an implicit equation for fi. 

Lemma 1. If K is symmetric and positive definite and B is symmetric, the map J : Sym(n) — ► 
Sym(n) given by J : fi 1— > Xfi + fii^ /ias kernel zero, and is hence an isomorphism. 

Proof. Since fi is symmetric, there exists an orthogonal matrix C, such that CBC = 
where \1/ is a real diagonal matrix. Thus if KB + BK = 0, then 

C T KC^ + mC T KC = 0, 
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where C T KC is a positive definite symmetric matrix. If e is an eig envector of K = C T KC, 
then 

Ke = Ae, A > 0, 

and 

+ = K(9e) + A(tfe) = 0. 

Hence \I/e is also an eigenvector of K with eigenvalue —A < 0. But K is positive definite and so 
all its eigenvalues are strictly positive. Thus, we have a contradiction, unless \1/ = and hence 
B = 0. □ 
Here Sym(n) denotes the space of n x n symmetric matrices. Now we state the main result 
of this subsection, which expresses the geodesic flows on the Stiefel manifold in terms of the 
quantities Q and S = QUA; the pair (Q, S) can be used to parametrize the tangent bundle 
since S = Qhr 1 . 

Proposition 2. The geodesic flow on the Stiefel manifold given by the variational problem (0) 
is of the form 

Q = SAT 1 , S = BQ, (37) 



where 



and 



B = J Q \e), E = -2QQ T = -2SA- 2 S T , (38) 



J Q (B) ± (QA- l Q T )B + B(QA- l Q T ); J Q : Sym{n) -> Sym{n). (39) 

Proof. The proof of this result makes use of the simple observation that 
QA~ 1 S T = -QUQ T 

is skew-symmetric. We already know from Proposition^ and equation (|2"9j) that S = BQ where 
B is a n x n symmetric matrix. To obtain an expression for B, we take a time derivative of 

QA^S T + SA^Q 1, = 

along the extremal trajectories. This gives us 

QA- 1 Q T B + BQA^Q^ = SA^UQ 1 ^ - QUA^S 1 

=> Jq(B) = - 2QQ T = -2SA- 2 S T , 



since S = QUA. Since QA l Q^ is positive definite, we know from Lemma ^ that Jq : 
Sym(n) — > Sym(n) is an isomorphism, and hence the inverse B = J Q (2QU 2 Q l ) exists 
and is unique. This proves the proposition. □ 
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Note that the quantity S = QUA depends on the equivalence class defined by (J2o"|) ; so equations 
(p?7 ]) -pn j) in terms of (Q,S) also have the advantage that they uniquely express the extremal 
flows, whereas U in equations (|2*H) - (|2^jl is not unique. Since S = QA^ 1 , the above proposition 
also expresses the extremal flow in terms of the tangent bundle pair (Q, Q) G TV(n, N). 

Now we give another method to obtain the above geodesic flow from the constrained 
Lagrangian 

C C (Q, Q) = C(Q, Q) + l -(B, QQ T - I n ) = ±«Q, Q)) + QQ T - J n ), (40) 

where B G Sym(n) is a Lagrange multiplier matrix. We form the Hamiltonian for this problem 
by applying the Legendre transform to the Lagrangian (J20J) . which gives the same result as 
applying the Legendre transform to the constrained Lagrangian (|4f)|) . Just as TV(n, N) can 
be viewed as a submanifold of V(n, N) x W lN C W lN x R nN , T*V(n, N) can also via (•, •) be 
viewed as a submanifold of V(n, N) x ~R nN C IR nJV x W nN , and endowed with the symplectic 
structure induced from the canonical symplectic structure of M. nN x W nN . 

Proposition 3. The Legendre transform ¥£ : TV(n, N) — > T*V(n, N) is obtained from the 
fiber derivative of the Lagrangian C : TV(n, N) —> E as 

¥£(Q,Q) = S. (41) 

Proof. Let U, V G so (AT), so that QU, QV G TgV^n, AT). The fiber derivative of the Lagrangian 
(j2Uj) is given by 

¥C(Q,Q)(T) = f s \ s=0 C(Q + sT) 

= ^- s \ s=0 l((Q + sT,(Q + sr))) 

= ^«r,Q» + ^(g,r» = «g,r» 

^¥C(Q,Q)(T) = (QUA,T) = (S,T), 

or by identifying T*V(n, N) with TV(n, N) through (-, •), ¥C(Q, Q) = S. Thus, the map given 
by dHJ is the Legendre transform ¥£ : TV(n, N) -> T*V(n, AT). □ 
Proposition|3]can be used to obtain the Hamiltonian from the constrained Lagrangian as follows: 

H C (Q,S) = (^(Q,S) -C c (Q,Q)j |q =F £-i( 5 ) 

= ((SA-\ SA- 1 )) - l((SA-\ SA- 1 )) - I<S, QQ T - Iu)\q =sa -i 



-{{SA-'^A- 1 )) - ^{B, QQ T - I n ) |q =5A -i 
~ T^SA- 1 ) - i Tr(5QQ T - 5). (42) 
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The extremal (geodesic) flows are then be obtained from this Hamiltonian once the flow is 
restricted to V(n,N), as: 

Q = Grad 5 # c (Q, S) = SA" 1 , 

S = - Grad Q # c (Q, S) = BQ, (43) 

which are identical to equations (}3Tj) in Proposition^ Thus, we can parametrize TqV(h, N) 
by (Q, S), and equations (|3*Tj) then express the extremal flows in terms of this parametrization. 

In the following subsection, we define a momentum quantity that generalizes the body 
momentum of the iV-dimensional rigid body, and that can be expressed in terms of Q and 5*. 
This expression of the body momentum is then used to solve for the unique equivalence class 
[U], that corresponds to a given extremal solution pair (Q, S) of the variational problem. 

3.2. The Momentum Equation and its Solution 

Using equation (J2T| . we can write the Lagrangian in the form 

C(Q,U) = -^Ti(AUQ T QU) = ~{QU,QUA). 

We define the (body) momentum as the ^/-gradient of the Lagrangian with respect to the 
pairing (•, •) in (Q): 

M = 2Grad c/ £(Q, U) = Q T QUA + AUQ T Q = Q T S - S T Q. (44) 

We call this the body momentum, since in the case n = N (iV-dimensional rigid body), this 
quantity is the momentum expressed in the body coordinate frame. The body and spatial 
momenta quantities are related by 

m = QMQ T , (45) 

as can be verified using equations (|27J) and (}4*4"j) and the constraint QQ^ = I n . One can verify 
that with the body momentum defined as in (jUJ), equation (|2~2"j) is equivalent to 

M=[M,U]-A, where A = Q T SU + US T Q = [Q T Q,UAU]. (46) 

We show how these equations generalize the classical rigid body in N dimensions, i.e., the 
case n = N . In this case, the Euler- Arnold equations are given by 

Q = QU, M = [M, U], where M = UA + AU, 

and Q G SO(N). From equation (|25ji. we obtain in the case n = N 

UA + AU = AU 2 - U 2 A. 
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Taking a time derivative of M = UA + AU and subsituting the above equation, we see that M 
satisfies the Euler- Arnold equation. This is also equivalent to the flow of S being represented 
by (JUl). 

Now we present a solution of the algebraic equation ()44jl . which we rewrite below as 

M = Q T S-S T Q = Q T QUA + AUQ T Q = J Q (U) } (47) 

where QQ — I, U — —U , and A > (diagonal and positive definite). The map 
Jq : so(iV) — > so(N) is defined by equation (jUJ). We first present a few lemmas, which 
are necessary to prove the main result. 

Lemma 2. If K is symmetric and positive definite and X e 5o(iV) ; the map J : S0(n) — > so(n) 
given by J : X i— > KX + XK has kernel zero, and is hence an isomorphism. 

Proof. The proof of this statement is similar to the proof of Lemma ^ Since X is skew, there 

— rp — rp _ rp 

exists a unitary matrix L, i.e., LL = L L = ijv, such that LXL = zE, where S is a real 
diagonal matrix. Thus if KX + XK = 0, then 

LKL Ts L + Y.LKL 1 = 0, 

where Li^L is a positive definite Hermitian matrix. If e is an eigenvector of K = LKL , 
then 

Ke = Ae, A > 0, 

and 

£T£ + = if(£e) + A(£e) = 0. 

Hence Se is also an eigenvector of K with eigenvalue — A < 0. But K is positive definite and so 
all its eigenvalues are strictly positive. Thus, we have a contradiction, unless £ = and hence 
X = 0. □ 

Lemma 3. If J Q (U) = 0, then QU = 0. 

— rp 

Proof. From the proof of Lemma 1, we know that LUL = iT, where L is unitary. We define 
the Hermitian matrices 

R = LQ T QL T , K = LAL T . 
Then the result to be proved becomes equivalent to the following 

REK + KER = LJ Q (U)L T = RE = 0. (48) 
This is because 

RE = Q T QU = QU = 0, 
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since has full (column) rank. Thus we need to show that equation (J48|) is satisfied, to 
complete the proof. 

Note that £ is real and diagonal, K is positive definite, and R 2 = R, i.e., R is a projection 
matrix. Since R is a projection onto a subspace of dimension n, there exists a unitary matrix 
T (f T T = Tf T = Jjv), such that 





- nxn 




Ti?T T = 

















(N-n)x(N-n) . 



Relative to this decomposition, we write 



- T 
Si E 2 



- T - T 

Si = Si, £ 3 = E 3 , 



TKT T 



T T 



K x K 2 
K 2 K 3 

We may then express the equation REK + KHR = in the form 



-T ■ 
Si S2 






- T " 
K 2 






- T " 




" Si 


" 






+ 















K 3 




_K 2 


^3 _ 




_ s 2 






0. 



(49) 



Note that 



i?S = <£> Ei 



0. 



Since K is Hermitian and positive definite 

T 



= [0 x j 







> 



7-1 



K x K 2 
K 2 K 3 

— rp 

for all x^O. Hence, K 3 is Hermitian and positive definite. Also, since Z = TKT is positive 

definite, this implies that Z~ x is positive definite, and we denote 

- T 

L2 L3 

where L\ and L 3 are also positive definite (by the above argument). From the relations 

K 2 L, + K 3 L 2 = 0, K X L X + K 2 T L 2 = I n , 
we obtain L\ as 

L 1 = (K 1 -K 2 T K 3 7 1 K 2 y 1 . 
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_ rp 

Hence (K x — K 2 K% K 2 ) is also positive definite. Now equation ()49jl is equivalent to the 
following two independent equations: 

E X K X + Z 2 T K 2 + K X E X + i? 2 T S 2 = 0, 

K 2 Z X + K 3 Z 2 = 0. 

We know K3 is invertible so £ 2 = — K 3 K 2 H X and 

E X K X + (-K^K^f^ + K X Y. X + K 2 T (-K^K 2 ^ X ) = 

=> Z X {K X - K 2 T K^K 2 ) + (#! - K^K^K^ = 0. 

_ rp 

Here we note that - K 2 K^K 2 is a Hermitian positive definite matrix. Using the result 
of Lemma EJ we see that equation (|5()jl implies that £j = 0, and hence £ 2 = — ^Q" i^Ei = 0. 
Thus i?E = 0, which implies as we have shown that QU — 0. □ 
Now we present the solution to the algebraic equation (jUjl or (j%7j) . 

Theorem 2. AZZ solutions of the equation fl^7| ) have the form 

U = U 1 + U 2 + V, (50) 



where 



U x = A-\Q T (QA- l Q T )- 1 S - S T (QA- l Q T )- 1 Q)A-\ 

U 2 = A-\Q T Jq~\r)Q)A-\ (51) 



and 



R = (QA-'S^iQA-'Q 7 )- 1 - (QA- 1 Q T )-\SA- 1 Q T ), 

Jq{X) = (QA- l Q T )X + X(QA- l Q T ); Jq : so(n) -> so(n) (52) 

and QU = 0. 

Proof. We evaluate Jq(U x ) as follows 

Jq(e/i) = q t qa- 1 (q t (qa- 1 q t )- 1 s - s t (qa~ 1 q t )- 1 q) + (q t (qa- 1 q t )- 1 s 

- 5 T (gA- 1 g T )- 1 g) 

= q t s - q t (qa- 1 s t )(qa- 1 q t )- 1 q + q t (qa- 1 q t )-\sa- 1 q t )q - s t q 

— M - Q T RQ. 
Let U 2 = A~ 1 Q T XQA- 1 ; then 

Jq(U 2 ) = Q T {QA~ 1 Q T )XQ + Q T X(QA- 1 Q T )Q = Q T T Q {X)Q. 

Thus if Jq(X) = R, then Jq(U 2 ) = Q^RQ. From Lemma El we know that Jq : so(n) — > 
so(n) is an isomorphism. It follows that Jq is invertible, Jq (i?) = X is unique, and 
C/ 2 = A-^TxgA- 1 satisfies J Q {U 2 ) = Q T RQ. Thus, 

Jq(C/i + U 2 ) = J Q {U X ) + J Q (U 2 ) =M- Q T RQ + Q T RQ = M, 
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and hence all solutions of (J47j) have the form 

U=U 1 + U 2 + V, Jq(V) = 0. 

From Lemma 01 we know that Jq(V) = =>- QV = 0. Thus, we have proved this theorem. □ 
Note that this solution to the algebraic equation or f|47|) uses the decomposition of M in 
the (Q, S) variables. 

Let X = Jq (R), where R and Jq are as defined in equation (|52j) . The following lemma 
gives an expression for X in terms of Q and S. 

Lemma 4. If Jq and R are as given by equation \5$fy. then we can express 

X = Jq~\r) = {QX- 1 Q T )- 1 QX- 1 S T {QX- 1 Q T )-\ (53) 

Proof. If U is a solution of the algebraic equation (JUJ), then the quantity S can be expressed 
in terms of U of as 

S = QUA = QA- 1 {Q T (QA- 1 Q T y 1 S - S T (QA- 1 Q T y 1 Q} + Qhr x Q T XQ 
S = S- QA- 1 S T (QA- 1 Q T )- 1 Q + Qhr x Q T XQ 
=> QA- l Q T XQ = QA- 1 S T (QA- 1 Q T )- 1 Q 
^XQ = (QA- 1 Q T )- 1 QA- 1 S T (QA- 1 Q T y 1 Q. 

Post-multiplying both sides of the above expression with , we get the expression in for 
X. □ 
One can verify that Jq(X) = R, where X is as given by as follows 

Jq(X) = (QA- l Q T )X + X(QA- l Q T ) 

= (qa- 1 s t )(qa- 1 q t )- 1 + (gA- 1 g T )- 1 (gA- 1 5 T ) 
= (qa- 1 s t )(qa- 1 q t )- 1 - (gA- 1 g T )- 1 (,SA- 1 g T ) = R, 

where we used QA~ S = —QUQ is skew-symmetric in the last step above. 

The following statement is a corollary of Theorem |21 and Lemma 0] 

Corollary 1. For a given Q, the map Z : TgV(n, N) — > TqV(u, N) where Z : S i— > [£/] zs 
g'zven 6y Theorem^ is an isomorphism. 

Proof. Clearly, the map Z^ 1 : [U] ^ S = QUA is a linear isomorphism. Observe from Theorem 
Eland Lemma El that we get the following expressions for U\ and L^: 



Ui = A' 1 
U 2 = A' 1 



Q T (QA- l Q T )- l S - S T (QA- l Q T )- 1 Q 
Q T (QA- 1 Q T )- 1 QA- 1 S T (QA- 1 Q T )- 1 Q 



A" 1 , 
A" 1 , 
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which depend linearly on S. It is now easy to verify that 

QU X = [S- QA- 1 S T (QA- 1 Q T y 1 Q}A-\ 
QU 2 = [QA- 1 S T (QA- 1 Q T )- 1 Q]A-\ 

which gives us Q(U\ + U 2 ) = SA^ 1 , validating the relation S = QUA. Hence, the map 
Z : S t— > [U] is an isomorphism. □ 
Thus, Theorem El and Lemma 0] describe the exact relationship between the extremal solutions 
expressed in terms of (Q, [U]) in equations (J2TJl - (J22|l . and those expressed in terms of (Q, S) in 
equations (ETTI)-(I3H1). 

3.3. The Discrete Variational Problem 

The discrete counterpart of the variational problem Q and the discrete extremal trajectories 
obtained thereof were given by Moser and Veselov (1991). The discrete variational problem is 
given by 

min V^(Q fc+1 A,Q fc ), (54) 

k 

rp 

subject to QkQk = In- The extremal trajectories to this discrete variational problem are given 
by Moser and Veselov (1991) 

Q k+1 A + Q k -iA = B k Q k , keZ (55) 

where B k = G M. nn is a (symmetric) Lagrange multiplier matrix for the symmetric constraint 
QkQk = In- The above equation is the discrete counterpart of equation (pTTjl. The discrete 
body momentum is defined as 

M k = Qj^QkA - AQ T Q fc _i. (56) 

Since B k is symmetric, equation is equivalent to the conservation of the discrete spatial 
momentum 

m k+ i = Q k+1 AQ k - Q k AQ k+l = Q k AQ k _ x - Q k -iAQ k = m k . (57) 

rp 

Thus, m k = Qk-iM k Q^_i is conserved along the discrete extremal trajectories. Let us define 

U k = QfeiiQfe, 
which implies that 

Qk = Qk-iUk, M k = U k A - AUJ. 

The following proposition gives the discrete extremal trajectories in terms of U k and the discrete 
body momentum M k . 
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Proposition 4. The extremal trajectories of the discrete variational problem \54\j on the Stiefel 
manifold V(n,N) in terms of (M k ,U k ) are given by: 



where 



M k+1 = U k T M k U k - A k , (5* 



A k = U k T A(I N - U k T U k ) - (I N - U k T U k )AU k . (59) 



Proof: One can obtain this result from the second order difference equation (|55|) which gives 
the extremal trajectories for ()54j1 . Using this equation, we can represent the body momentum 
at the (k + l)-th step as 

M k+1 = Qj(B k Q k - Q k ^A) - (B k Q k - Q fe _iA) T Q fc 

= AU k - UJA. (60) 

From equation (|H7j) expressing conservation of the spatial momentum, we get 

QkM k+l Ql = Qk-iM k Qk_ x = Q k ujM k U k Q^ 
^M k+1 =U k T M k U k -A k where Q k A k Qj = 0. 

From the above expression, we get 

A k = M k+1 - ujM k U k , 

and now using equation ([60)1 . we obtain A k as given in equation ([59)1 . This proves the given 
result. □ 
Note that equation (jHOj) is equivalent to equations (|HHj) - (|H9*jl for the discrete body momentum. 
These equations are therefore the discrete counterpart of equation (pftjj) for the continuous case. 



Proposition 0] can also be used to prove Theorem 4 of Moser and Veselov (1991), which 
gives the following set of isospectral deformations for the discrete extremal flows 

L k+1 (X) = C fe (A)L fc (A)C fc - 1 (A), (61) 

where L k (X) = A 2 + \M k - \ 2 Q k _ l Q k „i, and C k (X) = A - XQjQ k -i- Note that L k (X) can be 



factored as 



L k (X) = (A + AQl iQfc)(A _ = C fc T (-A)C fe (A). 



Hence, the determinant of L k is an even polynomial in A of degree 2n. Using equations (f5£j) - (f59*J) . 
we can express the left hand side of equation (JBTj) as 

A 2 + X{QlQ k ^M k Ql_ x Q k - A k ) - X 2 QjQ k = A 2 + X(AQ^_ 1 Q k - QjQ^iA) - X 2 Q k Y Q k . 
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The right hand side of equation (J61j) is obtained from the factorization of Lfc(A) as follows 

C fc (A)(C fe T (-A)C fc (A))C fe - 1 (A) = C fc (A)C fc T (-A) 

= {A-XQjQ k - 1 )(A + XQj_ 1 Q k ) 

= A 2 + XiAQj^Qk - QlQk-xA) - X 2 QjQ k . 

Using this set of isospectral deformations, Moser and Veslov (1991) give a method to reconstruct 
the discrete flow under further conditions, and the discrete flow is integrable in this sense. 

4. The Optimal Control Problem 

We now study the Hamiltonian approach to the variational problem (|4"|). The Hamiltonian for 
the optimal control problem (jlj) is given by 



Tr (P T QU) + l - Tr(AUQ T QU), (62) 



H(P,Q,U) = (P,QU)-C(Q,U) 

where P G M. nN denotes the costates (Lagrange multipliers). This optimal control problem is 
nominally posed on W n> jv = 

R nN x M niV Qn 

which the symplectic structure is given by the 

symplectic form 

uj((A u A 2 ), (B u B 2 )) = (A!, B 2 ) - (A 2 , B x ). (63) 

We restrict the solutions of this optimal control problem to those which are governed by 
extremals that leave the submanifolds W^ N invariant, where the W% N are level sets of W n ^ 
specified by 

Wl N = {(Q,P)eW n , N ; QQ T = I n ; QP T + PQ T = k}, (64) 

where k is some constant symmetric n x n matrix. Note that the dimension of W% N is given 
by 

Dim W^ N = Dim V(n, N) + nN 1 — ; = 2Dim V(n, N), (65) 

which is equal to the dimension of the (co)tangent bundle of V(n, N). 

4-1. Space of extremal solutions to the optimal control problem 

Consider the vector space L of vector fields on W n> N characterized by the differential equations 
Q = QU, Ueso(N), 

P = PU + QV 1 Ve5o(N). (66) 
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Let X\ = (QUi, PUi + QVi) and X 2 = (QU 2 , PU 2 + QV 2 ) be vectors in L. This vector space is 
seen to be a Lie algebra, since: 

[X U X 2 ] = [(QU 1 ,PU 1 + QV 1 ),(QU 2 ,PU 2 + QV 2 )] 

= {Q[U U U 2 ],P[U U U 2 ] + Q[V U U 2 ] +Q[U l , V 2 ]), 

also belongs to L. We now show that the submanifolds W* N are integral manifolds to the 
involutive distribution of vector fields in the Lie algebra L. 

Lemma 5. The submanifolds W^ N C W n ^ are integral submanifolds to the involutive 
distribution on W n ^ defined by the Lie algebra of vector fields, L. 

rp rp rp 

Proof: Differentiating the constraints QQ = I n and PQ + QP = k, along trajectories of 
vector fields in L defined by the system we get: 

QUQ 1 - QUQ T = 0, 

(PU + QV)Q T - PUQ T + QUP T + Q(-UP T - VQ T ) = 0. 

This shows that all vector fields in L are tangent to each of the submanifolds W^ N of 
W n , N = R nN x R nN . We find the dimension of the subspace L(Q,P) of T^q,p)W^ n spanned by 
the vector fields in L. Since U and V are independent 

Dim {{QU,PU + QV), (Q,P)eW£ )N , U,V E so(N)} 
= Dim {(QU, PU), U G so(N)} + Dim {(0, QV), V G so(N)} 
= Dim {(QU, 0), U G so(N)} + Dim {(0, QV), V G so(N)} 
= 2Dim {QU, U G so(N), QQ T = I n } = 2Dim V(n, N), 

since the dimension of the tangent space to V(n, N) at Q is the same as the dimension of 
V(n,N) itself. From (|65jl . we see that this is also the dimension of W^ N , and hence that of 
T(Q,p)W£ N . Thus, we conclude that at every (Q,P) G W^ N , 

L(Q,P)=T (Q>P) Wl N . 

This proves the given result. □ 

Now that we have shown that the W% N are integral submanifolds of the Lie algebra L, we 
next state and prove the following important result. 

Theorem 3. The space W^ N defined by is a symplectic submanifold of W n ^ = {R nN x 



A variational problem on Stiefel manifolds 22 

Proof: To show this result, we need to show that the symplectic form u on W n ^ given by 
(jnHJ), is non-degenerate on W^ N when restricted to W£ N . Let us denote the restriction of the 
symplectic form on W n ^ to W^ N by 



ft — UJ\ 

Tangent vectors to W% N are given by (jHEJ)- Thus we must show that 

n((QU u PUt + QVt), {QU 2 , PU 2 + QV 2 )) = 

for all (U 2 ,V 2 ) G so(N) x so(N) implies that (QU U PU X + QVL) = (0,0). By definition of Q, 
we have 

(QU U PU 2 + QV 2 ) - (QU 2 , PU X + QVi) = 0, V (U 2 , V 2 ) e so(iV) x so(iV). (67) 
Setting P 2 = 0, we get 

Ti(U 1 Q T QV 2 ) = 0, V V 2 . 
Since V 2 is skew-symmetric, this in turn implies that U\Q Q is symmetric, or 

QU 1 Q T Q = -QU U since QQ T = I n . 

Thus, each row of QU\ is an eigenvector of Q Q with an eigenvalue of —1. Since IS 
positive semi-definite, this is a contradiction unless QU\ = 0. Now setting QU\ = we have 
from (|H7j) that 

(QP 2 , PU X + QFi) = Tr{U 2 Q T {PU 1 + QV X )) = 0, V P 2 . 
Since P 2 is skew-symmetric, this implies that 

Q T (PU 1 + QV^ = - (U 1 P T + V l Q T )Q 

^PU 1 Q T + QV 1 Q T = - QUiP T - QViQ T 

QViQ T = - QViQ T since QU X = 

=*► QViQ T = 

=*► q t (pu 1 + qv 1 )q t = o = -(PiP T + V!g T )gg T 

+ QV^ = 0. 



Thus, (f2, H^jv) for any value of k is a symplectic submanifold of (a;, W n ,jv)- □ 
We restrict the extremal flows of the optimal coi 
for a value of k given by the initial conditions. 



We restrict the extremal flows of the optimal control problem to the symplectic manifolds W* N , 
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4-2. Solution to the Optimal Control Problem 

The extremal solutions on W^ N to the optimal control problem are characterized by the 
following result. 

Theorem 4. The extremal trajectories of the optimal control problem 01), (Q,P) € W% N) are 
given by 

Q = QU, Ueso(N), (68) 
P = PU - QA, (69) 

where A = Q T QUAU - U\UQ T Q. 

Proof. The Hamiltonian (jSZj) can be written in the alternate form 

H(P, Q, U) = (P, QU) + ^(Q T Q, UAU) (70) 

This optimal control problem may be restricted (with possible loss of generality) so that 
extremal trajectories lie on the symplectic manifold (W^ N ,Q), where fl = uj\ W k is the 
symplectic two-form given by 

Q((Q 1 ,P 1 ),(Q 2 ,P 2 )) = {Q U P 2 ) - (Q 2 ,P 1 ). (71) 

Hence, we have 

dH(X 2 ) = n(X 1 ,X 2 ), (72) 

where X± = (Qi, Pi) and X 2 = (Q 2 , P 2 ) are vector fields such that Qi = QU, Pi = PUi — QVi 
i = 1, 2, Ui, Vi E so(N), and X 1 = X H . Thus, from (Jl2j), we get 

(PU 2 - QV 2 , QU X ) - (PC/x - QV U QU 2 ) 
= (PU 2 -QV 2 ,QU) + (P,QU 2 U) + ^([Q T Q, f/ 2 ],f/Af/>. 

Setting U 2 = in the above expression, we get QU\ = QU. Thus, from above, we get 

(PU 2 ,QU)-{PU-QV h QU 2 ) 



So 



(PU 2 ,QU) + (P,QU 2 U) + ^([Q T Q,U 2 ],UAU). 



(QVt,QU 2 ) = ~([Q T Q,U 2 ],UAU) 

= ^(Q T QU 2 -U 2 Q T Q,UAU) 

= ^(QU 2 ,QUAU) + ^(QU 2 ,QUAU) 
= (QU 2 ,QUAU). 
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But 

(QU 2 ,QUAU) = (QU 2 ,Q(Q T QUAU -UAUQ T Q)), 

rp 

since QQ — I n and 

(QU 2 ,QUAUQ T Q) = (QU 2 Q T ,QUAUQ T ) = 0, 

as U 2 is skew-symmetric and QUAUQ is symmetric. Hence, we get V\ = [Q Q, UAU] up to 
equivalence, and Xh is the vector field given by 

Q = QU, P = PU - QA, where A = [Q T Q, UAU}. 

Thus, the Hamiltonian vector field Xh prescribes the flow given by equations (pT%J) - ()69j) . □ 

We determine the optimal control U applying Pontryagin's maximum principle (see Bloch 
et al (2003), Gelfand and Fomin (2000) and Kirk (2004)). The Hamiltonian in © can also 
be expressed as 

H{Q,P,U) = Ti^(P T Q-Q T P)U+^TtQ T QUAU. (73) 

Then Gradf/if (Q, P, U*) = with respect to the pairing in ((TJ) is equivalent to 

Q T P - P T Q = Q T QU* A + AU*Q T Q = M, (74) 

where U* is the optimal control. This equation gives the momentum M in terms of the states 
Q and costates P and also in terms of Q and U* (or Q and S), as given by the solution to 
the variational problem in equation ([44)1 . We now appeal to Theorem |21 to give an explicit 
representation for U* in terms of (Q, P), by replacing S with P. 

Taking the time derivative of the equation M = Q P — P Q along the vector field given 
by equations we get 

M = [M, U] - A, (75) 

which is identical to equation obtained from the solution to the variational problem. 

omce QAQ T = 0, one can right multiply (jHSJ) with to get 
PQ T = PUQ T , 

which implies ffrom I55j) that A = PQ^ is conserved. Hence the symmetric quantity 

k = PQ T + QP T = A + A T , (76) 

is conserved along the trajectories of (|58 j) - (|59j) . Since the spatial momentum is 

m = QMQ T = PQ T - QP T = A - A T , (77) 
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as originally defined in (J27)) . this quantity is also conserved along the flow of the extremal 
solution (|f)8 )l -(|f)9 |l to the optimal control problem. Hence, for each initial condition set, the 
solution trajectory is confined to the level set 

W£ = {(Q,P) G W n , N ; QQ T = I n ; QP T + PQ T = k, PQ T -QP T = m}, (78) 

which is a submanifold of W^ N . The manifold W* N is formed by an union of the over all 
values of m. Clearly, the Hamiltonian vector fields given by ([BH )) -(p9* ]) are tangent to the 
which are level momentum sets of this Hamiltonian flow. Note that PQ = A = ^(k + m) is 
constant for (P, Q) G C W^ N . 

The quantity that corresponds to the body momentum in the full-ranked case, is according 

to flZU 

M = Q T P - P T Q, (79) 

which has been previously defined in equation (jUj) in terms of the solution to the variational 
problem. In the case n = N we have obtained Bloc et al (2002) the following relation when 
||m|| < 2 

For the general case, 1 < n < N, when PQ = A is orthogonal and ||m|| < 2, the extremal 
solutions (|U8* j) -(|U9* j) in C W^ N can be expressed as 

QeV{n,N), P = e sinh "^Q + QR, 
where k = + c, c = e~ smh 1 t , and 

R G so(N), QRQ T = 0, R = [R, U] - A. 
This can be easily verified by direct substitution into equations 

Note that the quantity k specifies the symplectic submanifold on which the extremal 
solution lies, while the spatial momentum m specifies the momentum level set. We want 
to express the costates P of this optimal control problem in terms of Q and the momentum 
quantities m and M, given by (|77|) and (|79j) respectively. Note also that if (Q, P) is an extremal 
trajectory to the optimal control problem, i.e., 

Q = QU, P = PU-QA, (80) 

then (Q, P + hQ), where h is a constant n x n matrix, is also an extremal trajectory satisfying 
(IHOl . If (Q, P) lies in the solution submanifold W* N given by (JHU), then (Q, P + hQ) G W 1 



k+b 
n,N 



where b = h + h . Further, if h is symmetric and {Q,P) lies in the momentum level set 
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C W% N , then (Q,P + hQ) lies in the momentum level set W^~ 2h C W^ h ; i.e., the spatial 
momentum value remains unchanged. One can verify that in this case, the body momentum 
value M also remains unchanged. This brings us to the following remarkable result. 

Proposition 5. We have a map E : W^ N — > W° N , defined by E : (Q, P) h-> (Q, P ) G W° n , 
where 

P Q = P-\kQ, (Q,P)eW£ N . (81) 

Further, if (Q,P) G C W* N , then (Q,Pq) G W 7 "^ C W° )JV; i.e., it satisfies equations 
and 

PoQ T - QP T = m =PQ T -QP T , 

Q T P - P T Q = M = Q T P- P T Q. 
Hence the map E leaves the spatial and body momenta unchanged. 
Proof. Let (Q, P) G W* N . Then we have 

PoQ T + QPj = (P~ \kQ)Q T + Q(P - l -kQ) T 

= PQ T ~\k + QP T ~\k 
= k - k = 0. 

Thus (Q, P) is mapped to (Q, P ) G W° )JV by S. If (Q, P) G W£, then PQ T - QP T = m in 
addition to + QP = fc, and we have 

PoQ T - QPj = (P- \kQ)Q T - Q(P - \kQ) T 
= PQ T -h-QP T + l -k 
= PQ T - QP T = m. 

This proves the second part of the statement. □ 

The following result is a corollary of Proposition El and gives the costate P = Po as a 
function of Q, m and M, such that (Q, P ) G W^. As we show later, this costate is a natural 
choice since a direct relation exists between the symplectic forms on T*V(n, N) and W® 



n,N- 



Corollary 2. The pair (Q, P ) G satisfying equations HHty . may be expressed as 



Po = ~\ m Q + Q M - (82) 



For this solution, Pq = QM where 



M = ~Q T mQ + M (83) 



is a momentum-like quantity. 
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Proof. The proof of this statement can be carried out in two stages. In the first stage, we 
consider an extremal solution pair (Q,P) G C W% N and obtain an expression for P in 
terms of Q and M. We observe that if 

P = fiQ + QM, 

where \x G M nn is constant and M is the body momentum, then (Q,P) G W^ N . This can be 
easily verified by taking a time derivative of P along the extremal solutions given by equations 
(168)1 and (|75jl . We also observe that PQ^ = fi + m for this solution pair. This solution is in 
C W£ N if and only if 

PQ T + QP T = fj, + /i T = k, and 

PQ T - QP T = fi - /i T + 2m = m /i T - /i = m. 

This gives // uniquely as 

, = >-.»). 
and thus, (Q, P) G W 7 "^ where 

P= ^(fc-m)Q + QM. (84) 

Now applying Proposition in the second stage of this proof, we have (Q, P ) G where 
P = P — which when applied to (|54*jl gives the result expressed in equation (|52^l . As a 
result of Proposition we also know that the momenta m and M are left unchanged by this 
transformation in the costate variable. That Po = QM is easily verified by susbtituting M 
from equation and comparing with equation (JH2J. □ 
Note that, with the costate variable P = P , the solution to the optimal control problem 
satisfies 

k = QPj + P Q T = --m + -m = 0. (85) 

Also note that, for the special case of the rigid body in N dimensions (n = N), the momentum 
quantity M = |M is half the body momentum. 

This result has further important implications for the symplectic structure of W® N and its 
relation to the symplectic structure on T*V(n,N), which we will explore in the next section. 
Note that the equations ()68j l -(|69 p conserve PQ^ and QP^~ separately. In the symmetric 

rp 

representation of the rigid body equations given in Bloch et al (2002), PQ L and hence 
PQ T - QP T and PQ T + QP T are constant, and this generalizes to the extremal solution 
(in W^ N ) of the optimal control problem on the Stiefel manifold V(n, N) for 1 < n < N . 
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4-3. Correspondence of the Variational and Optimal Control Solutions 

We now give the correspondence between the variational (or Lagrangian) and optimal control 
(or Hamiltonian) respresentations of the extremal solutions to this problem. The extremal 
solutions to the variational problem can be defined in terms of the pair (Q, Q) G TV(n, N) or 
(Q, S) G T*V(n, N) and extremal solutions to the optimal control problem are defined in terms 
of the pair (Q, P ) £ N , a symplectic manifold. Here we describe a correspondence between 
them, an : T*V(n, N) -> W° N such that 2K : (Q, S) i-> (Q, P ) G W® N . 

From the solutions to the variational and optimal control problems, we know that the body 
momentum M satisfies 

M = Q T S - S T Q = Q T P - P T Q. (86) 

In addition, on pre-multiplying equation (jSfij) by Q and post-multiplying it by on both 



sides, we also get 

m = QMQ T = SQ T - QS T = P Q T - gP T , (87) 

which is derived from (jHEJ)- Equation (|8fi|) leads us to the following result. 

Corollary 3. The extremal solutions (Q, S) and (Q, Pq) to the variational and optimal control 
problems, respectively, are related by 

P = S + X (88) 

where 

X = DQ, D = D T . (89) 

This statement is a corollary of Proposition 21 and is obtained from inspection of equation 
(jHo^l . The difference Pq — S = X in equation (jHHj) obviously lies in the kernel of the linear map 
Lq : M. nN — > ~R NN defined in flrU)l) . From Proposition ^ we know that X is going to have the 
form given by equation (jgSj) . 

For the extremal solution to the optimal control problem (Q, P ) G C W® N , we know 
from Corollary El that P = — \mQ + QM . The extremal solution of the variational problem 
in Section El was given in terms of (Q,S) G T*V(n, N). The following result gives the relation 
between these two solution pairs. 

Proposition 6. The map Wl : T*V(n, N) -> W° N defined by Wl((Q,S)) = (Q,P ), is a 
diffeomorphism and Pq is given in terms of Q and S by 

P = S + DQ, (90) 



A variational problem on Stiefel manifolds 29 
where 

D = D T = l -Q[A,U]Q T =- l -{QS T +SQ T ). (91) 
The inverse of this diffeomorphism is given by OJl -1 : (Q, Pq) h- > (Q, S) where 

s = p - [(gA- 1 p T )(gA- 1 g T )- 1 - gA^g^^^g, (92) 

where Jq (P ) is given by equations ( T5IJ) with S replaced by P . 

Proof. From Corollary El we know that Pq = S + DQ where D is symmetric. Thus, we have 
S = P - DQ = -{~m + D)Q + QM, 

rp rp rp 

where M = Q S — S Q and m = QMQ . This gives us 
-(im + £>)Q + Q(Q T 5 - 5 T g) = 5 

-~(S'g T - QS T )Q - DQ = QS T Q 

^-DQ =^(QS T + SQ T )Q 

^D = ~\{QS T + SQ T ) = - ig[A, f/]g T 

since 5 = QUA, where QU is obtained from Q and 5 using Theorem El This establishes the 
relation given by equations (|9*0 J) -([9"T ]) between the variables S and P , and P is seen to be 
linearly dependent on S. Clearly, if S = then P — 0. For the converse, if Pq = then 

rp rp rp 

S = —DQ. Since D is symmetric, we have Q S = S Q = —Q DQ. Since S = QUA, we 
have Q^ S — S^Q = Jq(U), as defined in equation From Lemma 01 we know that if 

J Q (U) = 0, then QU = 0. Thus, when P Q = 0, we have J Q (U) = and hence S = QUA = 0. 
Thus 2R((Q,S)) = (Q,Pq) as given by (fHUJ) is a diffeomorphism. For the inverse of this map, 
it is clear from Theorem 121 and its proof that U can be expressed in terms of Q and Pq in the 
same manner that it is expressed in terms of Q and S in that theorem. This is true because 

rp rp 

that representation is based on the decomposition M = Q S — S Q, which has the same form 
as M = Q^-Po — PqQ- Hence, we also have a diffeomorphism Zq : W^ N — > TV(n, N) defined 
by Z ((Q, Pq)) = (Q, [U]) = (Q, [U x + U 2 \) as follows 

U x = A- 1 (Q T (QA- 1 Q T )- 1 P - P T (QA- 1 Q T )- 1 g)A- 1 , 

u 2 = a-\q t Jq'\r )q)a- 1 , 

where 

r = (gA- 1 p T )(gA- 1 g T )- 1 - (gA- 1 g T )- 1 (p A- 1 g T ). 

Thus the inverse of DJt is given by = Z^ 1 o Zq, where Z is as defined in Corollary^ From 
this corollary, we also know that S = Q(U\ + U 2 )A uniquely determines S, and this gives us 
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the relation (|92|) for S in terms of Pq and Q. 



30 
□ 



Note that if A = pv, the N x N identity matrix, then S = Pq. Also note that 
D = (gA- 1 P T )(gA- 1 g T )- 1 - QA- 1 Q T J q '\r ) in flH|) is a symmetric matrix. This can 
be shown using equations of Theorem as follows 



D-D 



T 



(gA- 1 p T )(gA- 1 g 



QA^qTJq '{Ro)- J q \r )qa- 1 q 



-i^>T 



-(gA^gTr^QA^pT) 



(gA- 1 p T )(gA- 1 g T )- 1 

-Ro — Jq(X ) = Rq — Rq 



QA~ 1 Q T X - X QA- l Q T 



0. 



(gA- 1 g T )- 1 (gA~ 1 p T ) 



where Xo = Jq (Po)- This expresses the symmetric matrix D in terms of Q and Po, while P 
is also given as a function of Q and 5* by equation (JHU). 

5. The tangent and cotangent bundles of the Stiefel Manifold 

In this section, we explore the structure of the tangent and cotangent bundles of the Stiefel 
manifold, which is the homogeneous space V(n, N) = SO(N)/SO(N — n). The Stiefel manifold 
is parametrized by Q e M. nN such that QQ^ = I n . Consider the point go = [I n 0] on V(n, N), 
where denotes the (N—n)xn matrix of zeros. SO(N) acts on go on the right as 1Z : go i— > goP 
where P G SO(N), and the isotropy group is the subgroup 

P 



P G SO(N) | P 



P 



P G SO(N-n) 



This gives the dimension of V(n, AT) as the difference of the dimensions of SO(N) and SO(N-n) 
dim V(n, N) = AT (AT - l)/2 - (JV - n)(JV - n - l)/2 = nN - n(n + l)/2. 



5. i. Symplectic Structure of the cotangent bundle of the Stiefel Manifold 

The tangent space to V(n, A") at g is parametrized by QU, U G so (AT). However, P is unique 
only upto the equivalence class [U] defined by equation If g = [J n 0] as before, then the 
set V G SO (AQ such that QV = is given by 





V G SO(N) | V 



V 



V G so(N - n) 



This determines TqV(u, N) as the vector space 
so(AO/so(AT-n). 
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We occasionally refer to elements of TqV(n, N) as the equivalence classes Q[U], where [U] is 
as defined by equation (J25J) . 



We may parametrize the co-tangent space TqV(u, N) by 

?6T*%iV), P = 2(QM,-), Meso(N). 

Let X G TqV(n, N), hence X = QU, U G so(iV). Then we have 

P(X) = 2{QM, QU) = 2(Q T QM, U). 

In the case n = N, this pairing gives the Killing form on so(N), which is non-degenerate. The 
following result shows that in the equivalence classes [•] defined on so(N), this pairing between 
TqV(h, N) and TXV(n, N) is non- degenerate. 

Lemma 6. The pairing O : T^V{n,N) x T Q V{n,N) -> R given by D(Q[M],Q[U]) i-> 
(Q[M],Q[U]) is non- degenerate. 

Proof. For this pairing to be non-degenerate on TqV(n, N), we should have 

(QM, QU) = V U Q[M}= 0. 
Evaluating the pairing on the left hand side gives us 

(QM, QU) = (Q T QM, U) = V U G so{N) 

Q T QM + MQ T Q = 0. 

Now, by Lemma El the above equality is satisfied if and only if QM = (substitut- 
ing U = M and A = Jjv in Lemma EJ). Thus, we have shown that the bilinear pairing 
0(Q[M], Q[U}) ^ (Q[M\, Q[U}) is non-degenerate on T Q V(n, N). □ 
Hence, this representation of TqV(u, N) is well defined. 

Identifying T*V(n, N) and TV(n, N) by this pairing, we may parametrize TT*V(n, N) by 
vectors (Q, QM, QU, QUM + QZ), where M,U,Z G so(N). Viewing the co-tangent bundle 
T*V(n, N) as a subset of R nN x R nN , we may pull back the symplectic form on R nN x R nN 
given by 

u((A!, 50, (A 2 , B 2 )) = (B 2 , A,) - (B u A 2 ), 
to T*V(n, N) via the inclusion map. If 

X, = (Q, QM, QU U QU X M + QZ X ), X 2 = (Q, QM, QU 2 , QU 2 M + QZ 2 ) G T {Q>QM) T*V(n, N), 

then the two form we obtain on T*V(n, N) is 

uj^qm^X,) = (QUzM + QZ^QUj - {QU X M + QZ U QU 2 ) 

= (Q T QU 1 , Z 2 ) - (Q T QU 2 , Z x ) + (M, U X Q T QU 2 - U 2 Q T QU X ). (93) 
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It is simple to check using Lemma El that this is indeed nondegenerate and hence a symplectic 
form. This expression for the symplectic form can also be obtained using the canonical structure 
on T*V(n, N). In general, for a smooth manifold M and using the projection 7r : T*M — > M, 
we define a one form 9 on T*M by 

6 p (X) = pn*X, 

where p G T*M and X is a vector field on T*M. We define the canonical symplectic form u c 
on T*M by setting u c = —d9, where d9 is the exterior derivative of the one form 9. We may 
simplify this expression using the identity (given in Bloch et al (2003)) 

d9(X, Y) = X(9(Y)) - Y(9(X)) - 9([X, Y]), 

so 

u B (X, Y) = Y(9(X)) - X(9(Y)) + 9([X, Y]). (94) 

We apply equation to the case where the manifold M = V(n,N). We parametrize 
T*V(n, N) by pairs (Q,M), Q G V(n,N) and M G so(N). Then we can write elements of 
T {Q ,M)T*V(n, N) at the point (Q, M) as 

Xk = (QUk,Z k ), k = 1,2. 

Hence, if ir is the projection 7r : TT*V(n, N) — > TV(n, N), then 7r^Xfc = QUk, and if 
p G T*V(n, N),p= (QM, •), then 

9(X k )= P 7^X k = (QM,QU k ). 
Thus, from equation (jHlj) we have 

u) e {X u X 2 ) = X 2 {QM,QU 1 )-X X {QM,QU 2 ) + (QM,7r*[X u X 2 ]}. (95) 

The last term in the above equation can be simplified as follows 

7r it [Xi,X 2 ] = [7r*Xi, 7r^X 2 ] o vr 

= [Qt/i,QC/2] (as vector fields) 
= QU X U 2 - QU 2 Ut 
= Q[Ux,U 2 ]. 

The first two terms are given by 

X 2 (QM,QU 1 ) = (QU 2 ,Z 2 )(QM,QU 1 ) = (QZ^QU^ + (QU*M,QU X ) + (QM,QU 2 U X ), 
and 

X X {QM,QU 2 ) = (QU U Z 1 )(QM 1 QU 2 ) = (QZ 1 ,QU 2 ) + (QU X M,QU 2 ) + (QM,QU X U 2 ). 
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Thus, we obtain 

u c (X 1 ,X 2 ) = (QZ 2 ,QU X ) - (QZ U QU 2 ) - (M,U 2 Q T QU X ) + (M,U 1 Q T QU 2 ) 
+ (M, Q T Q(U 2 U X - U X U 2 )) + (QM, Q[U h U 2 \) 
= (QZ 2 , QUt) - (QZ 1 , QU 2 ) + (M, U X Q T QU 2 - U 2 Q T QU X ), (96) 

which is identical to equation (|93j) obtained from restricting the two-form on ¥L nN x M. nN to 
T*V(n,N). 

We now have a formula for the natural symplectic form on T*V(n, N), and know that it can 
be formulated as the restriction of the symplectic form on the product R n7V x ~R nN . We want 
to use this formula to recover the Hamiltonian flow corresponding to the geodesic problem 
(gj) on V(n,N). The Hamiltonian is given by © as H(P,Q,U) = (P,QU) - (QU,QUA) 
and the optimal control U* is given by the maximum principle as in (|74|). We replace P by 
the parametrization QM, where we expect M to be the momentum. Thus, we can write the 
Hamiltonian as 

H{M, Q,U) = (QM, QU) - ~ (QUA, QU) . (97) 
The Hamiltonian flow on T*V(n, N) is obtained from the solution of 

dPi(M,Q,u)X 2 = co c (Xi, X 2 ), V X 2 = (QU 2 , Z 2 ), 

where X\ = Xh is the Hamiltonian vector field corresponding to H. We calculate 

dH(X 2 ) = (QU 2 M,QU) + (QM,QU 2 U) + (QZ 2 ,QU) - ^(QU 2 UA,QU) - l -(QUA,QU 2 U), 

and this is equated to uj c (Xh, X 2 ) in (pjj). From equating these expressions after replacing Z\ 
and U\ by Z and U , respectively, in ([96|). we get 

(QU 2 , QZ) = (QU 2 , QMU) - (QU 2 , QUM) - (QU 2 , QUAU). (98) 

Note that (QU 2 Q T ,QUAUQ T ) = since QU 2 Q T is skew-symmetric and QUAUQ T is 
symmetric. Therefore, (QU 2 ,QU AUQ Q) = and we can express the last term in equation 
Pty as 

(QU 2 ,QUAU) = (QU 2 ,Q(Q T QUAU - UAUQ T Q)). 
If F is skew-symmetric and (QU 2 ,QUAU) = (QU 2 ,QF), then 

(QU 2 , Q(F - [Q T Q, UAU})) = (QU 2 , QUAUQ T Q) = V U 2 . 
Thus F = [Q T Q, UAU] up to an equivalence class. Hence, from ()98|) we get 

(QU 2 , Q(Zt - [M, U] + [Q T Q, UAU})) =0VU 2 

=>Z X = [M, U] - [Q T Q,UAU], 
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up to equivalence class. It follows that the geodesies on T*V(n, N) with the metric in (J1J are 
given by the Hamiltonian flow 

Q =QU, 

M = [M, U] — A, A=[Q T Q,UAU], 
as given previously by (pjj) and (|75)i. 

Symplectomorphism between the cotangent bundle and W® N 

Finally we explore the relation between the symplectic structures on the cotangent bundle 
T*V(n,N) and W® N . Consider the Hamiltonian vector fields defined by (J58)l - (J59Jl : 

X l = (QU 1 ,P U 1 -QA 1 ), 
X 2 = (QU 2 ,P U 2 -QA 2 ), 

where A\ = [Q Q, L^AC/j] , i — 1,2. The symplectic form on W° N evaluated along these 
vector fields on W® N is obtained using equation (fTTj) as: 

n (QiPo) (X 1: X 2 ) = (P U 2 - QA^QUt) - (PoUi - QA U QU 2 ). (99) 

From Corollary H we know that if (Q, P ) G C W° >JV , then 

771 — 

Po = — 2 Q + QM = QM, 

where M is given by (|H3*j). 

Now we show the following relationship between T*V(n, N) and W® N . 

Theorem 5. The map $ : (SI, W^ N ) -> (w c , T*V(n, iV)), #raera &y (Q,P ) ^ (Q,QM) where 

M = -±Q T (P Q T -QP T )Q + Q T P - P T Q, (100) 

is a symplectomorphism. 

Proof: Coordinates for TqV(n, N) are given by (Q,QM). Consider the Hamiltonian vector 
fields on T*V(n, N) corresponding to the extremal flows for the optimal control problem 
considered in the last section 

Xi = d 1 (Q,QM) = (QU 1 ,QU 1 M + QZ 1 ), 
X2 = d 2 (Q, QM) = (QU 2 , QU 2 M + QZ 2 ), 

where 

Z 1 = d 1 M=[M,U 1 ]-A 1 , 
Z 2 = d 2 M = [M,U 2 ]-A 2 . 
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We show that the push-forwards of the Hamiltonian vector fields X{ on W® N defined earlier, 
give the Hamiltonian vector fields Xi on T*V(n, N). We have 

($^)($(Q,P )) = (T (QiPo) $-X 1 )($(g,Po)) 

= (QU i ,QMU i -QA i ) 

= (QUi, QU, t M + Q[M, Ui] - QAi) 

= (QUi, QUiM + QZi) = X i(Q, QM) 
<£> $*X; = Xi- 

The canonical symplectic form on T*V(n, N) is evaluated as 

^ c (q,qm)(Xi, X2) = (QU 2 M + QZ 2 , QU X ) - (QUtM + QZ 1 , QU 2 ) 

= (QU 2 M + Q[M, U 2 ) - QA 2 , QU X ) - (QU X M + Q[M, U x ] - QA U QU 2 ) 

= (QMU 2 - QA 2 , QU X ) - (QMU X - QA U QU 2 ). (101) 

The pull-back of this symplectic form from T*V(n, N) to W® N gives us the symplectic form on 
W® N , as shown below: 

( $ *^) (QiPo) (X X ,X 2 ) = U c ( QiQ M}($Jl^A) = (Q,QM)(Xl,X2) 

= (QMU 2 - QA 2 , QUi) - (QMUi - QA U QU 2 ) 

= (PoU 2 - QA 2 ,QU X ) - (P Q Ui - QAi, QU 2 ) = n iQ>Po) (X 1: X 2 ). 

However, from Lemma we know that the vector fields Xi form a Lie algebra that spans the 
tangent space to W® N at every point (Q,Po). Hence, we have 

$*u; c = a (102) 

The map $ : W^ N -> T*V(n, N) given by (Q, P ) i-> (Q, QM) where M is given by (HUUJ), has 
an inverse which is simply given by <3? _1 : (Q, QM) i— > (Q, Pq) where Pq = QM. The inverse 
map $ _1 is clearly a diffeomorphism. Thus, the map $ is a diffeomorphism that also maps the 
symplectic form in W® N to the symplectic form in T*V(n, N), and is hence a symplectomor- 
phism. □ 

With M expressed in terms of Q and Pq as in this theorem, we have that QM = P Q where 
(Q,P ) e W® N . Hence, Theorem |S] shows that the solution space W® N for the optimal control 
problem can be identified with the cotangent bundle of the Stiefel manifold. The Hamiltonian 
formulation of the optimal control problem parametrized by (Q, Pq) or (Q, QM) is related to the 
variational formulation of this problem parametrized by (Q,Q) or (Q, S), via the relationship 
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established in Proposition |U1 Combining Theorem |S] with Proposition El we get the following 
expression for M in terms of (Q,S): 

M = Q T S -S T Q, S = S(I N -^Q T Q). (103) 

Of course, in the case case A = In, we have = S and these solutions are directly related. 

6. Applications and Open Problems 

Nowadays numerical linear algebra computations and numerical integration of ODEs are in- 
creasingly based on variational problems on manifolds. Problems on Stiefel (and Grassman) 
manifolds are finding increasing use in numerical linear algebra applications (see, for exam- 
ple, Edelman et al (1998), Elden and Park (1999), and references therein). The first of these 
papers develops Newton and conjugate gradient methods on these manifolds, while the latter 
paper deals with a problem related to regression analysis in psychometrics. Another appli- 
cation of numerical calculations on Stiefel manifolds is in computing Lyapunov exponents for 
finite-dimensional dynamical systems by time integration. The Lyapunov exponents are com- 
puted by a continuous orthonormalization (which is essential for stable numerical integration) 
of a set of solution vectors of the linearized system. This amounts to restricting the linearized 
system to the Stiefel manifold V(k, N) for computing the k largest Lyapunov exponents of an 
N- dimensional system. A sample of the literature on this topic can be obtained from Bridges 
and Reich (2001) and references therein. 

It is known that the geodesic flow on the Stiefel manifold V(n, N) with a left-invariant 
metric is integrable for the extreme cases: when n = 1, which represents the geodesic flow 
on the sphere/ellipsoid; and when n = N, which represents the A^-dimensional rigid body 
on SO(N). Bolsinov and Jovanovic have shown that the extremal flows on Stiefel manifolds 
and other homogeneous spaces with bi-invariant metrics are integrable. There are two sets of 
integrals for such flows. The first set of integrals are the Noether integrals 

T x = {h o (25, (5 : TV(n, N) -> so*(N), h : so*(N) -> E}, 

where <3(Q, QU) = Jq{U) = M. In this case, it is easy to verify that M is conserved along the 
extremal flows. Without loss of generality, we may take A = 1^ as the bi-invariant metric, and 
the extremal flows are then given by 

Q = QU, Q = BQ, B = B T . 

The body momentum is M = Q QU + UQ Q, and its derivative along the extremal flows is 

M = Q T Q + Q T Q - Q T Q - Q T Q = 0. 
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This is a generalization of the case of the symmetric rigid body, where A is a scalar multiple of 
the identity matrix. The second set of integrals for this bi-invariant case is 

JF 2 = all SO(A/")-invariant functions on TV(n, N). 

The most important issue for future research is the integrability of the extremal solutions in 
the general case (1 < n < N) with a left-invariant metric. Another research issue of considerable 
interest for numerical applications is discretization of the optimal control problem (@J) based on 
the maximum principle, to obtain the corresponding discrete extremal flow in the states and 
costates. Treatment in more detail of special cases like the rank 2 case (n = 2) could also be 
carried out in the future. 

7. Conclusions 

In this paper we have presented the (continuous) geodesic flow on Stiefel manifolds with left- 
invariant metrics. The geodesic equations were obtained from two approaches; a variational 
approach taking reduced variations on the Stiefel manifold, and an optimal control approach 
using costate variables. We have attempted to generalize the symmetric representation of the 
A^-dimensional rigid body flow given by Bloch et al (2002) to geodesic flows on Stiefel manifolds. 

The solution manifold in the symmetric representation of the rigid body was SO(N) x 
SO(N). We found it easier to restrict the geodesic flows on Stiefel manifolds with a left- 
invariant metric to the W* N , which are symplectic submanifolds of M. nN x M. nN . The relation 
between SO(N) x SO(N) and N for any fixed value of k is given by: 

SO(N) x SO(N) = |J W£, 

me{m' | >^-eSO(N)} 

where the union is over all such m € so(N) that satisfy G SO(N). We obtained the 
geodesic flows using the maximum principle of optimal control theory and related them to 
the Hamiltonian flows using the natural symplectic structure on the Stiefel manifold. We also 
related this optimal control formulation to the form naturally derived from variational calculus. 

Note that the extremal solutions of the optimal control problem in W® N are a 
generalization of the McLachlan-Scovel equations (jSJ) for the A^-dimensional rigid body, where 

rp 

Q Pq is skew symmetric. However, these extremal solutions do not generalize the symmetric 

rp 

representation of the rigid body equations given in Bloch et al (2002), wherein Q P is 
orthogonal. 
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